[AArch64] Set the cache line size to 64 for the V2 and V3. #148213

sjoerdmeijer · 2025-07-11T11:00:23Z

This sets the cache line size to 64 for the Neoverse V2 and V3. I've tested this with loop-interchange: it doesn't result in extra compile-times, but it does enable a lot more interchange.

I've also set this for V3, have not tested this, but looks like the sensible thing to do, but am happy to remove this.

This sets the cache line size to 64 for the Neoverse V2 and V3. I've tested this with loop-interchange: it doesn't result in extra compile-times, but it does enable a lot more interchange. I've set this to V3, have not tested this, but looks like the sensible thing to do, but am happy to remove this.

llvmbot · 2025-07-11T11:00:53Z

@llvm/pr-subscribers-backend-aarch64

Author: Sjoerd Meijer (sjoerdmeijer)

Changes

This sets the cache line size to 64 for the Neoverse V2 and V3. I've tested this with loop-interchange: it doesn't result in extra compile-times, but it does enable a lot more interchange.

I've also set this for V3, have not tested this, but looks like the sensible thing to do, but am happy to remove this.

Full diff: https://github.com/llvm/llvm-project/pull/148213.diff

1 Files Affected:

(modified) llvm/lib/Target/AArch64/AArch64Subtarget.cpp (+1)

diff --git a/llvm/lib/Target/AArch64/AArch64Subtarget.cpp b/llvm/lib/Target/AArch64/AArch64Subtarget.cpp
index 68ed10570a52f..4ac93526295aa 100644
--- a/llvm/lib/Target/AArch64/AArch64Subtarget.cpp
+++ b/llvm/lib/Target/AArch64/AArch64Subtarget.cpp
@@ -270,6 +270,7 @@ void AArch64Subtarget::initializeProperties(bool HasMinSize) {
     break;
   case NeoverseV2:
   case NeoverseV3:
+    CacheLineSize = 64;
     EpilogueVectorizationMinVF = 8;
     MaxInterleaveFactor = 4;
     ScatterOverhead = 13;

kasuga-fj · 2025-07-11T11:41:12Z

it doesn't result in extra compile-times, but it does enable a lot more interchange.

Exactly, and that ended up exposing another correctness issue...
Sorry, I hadn’t really looked at the details of applied interchanges, so I totally missed it. Just noticed it a moment ago.

fhahn

it doesn't result in extra compile-times, but it does enable a lot more interchange.

Exactly, and that ended up exposing another correctness issue... Sorry, I hadn’t really looked at the details of applied interchanges, so I totally missed it. Just noticed it a moment ago.

For checking correctness of LoopInterchange, it would probably be good to test with interchanging all loops that are considered legal, ignoring the cost model.

kasuga-fj · 2025-07-11T11:50:25Z

it doesn't result in extra compile-times, but it does enable a lot more interchange.

Exactly, and that ended up exposing another correctness issue... Sorry, I hadn’t really looked at the details of applied interchanges, so I totally missed it. Just noticed it a moment ago.

For checking correctness of LoopInterchange, it would probably be good to test with interchanging all loops that are considered legal, ignoring the cost model.

It makes perfect sense to me.

kasuga-fj

Regarding this change, I think it's fine as long as it follows the specifications of Neoverse V2 and V3 (though I'm not familiar with them). However, I think it would be better to merge this after correctness issues of LoopInterchange are resolved to avoid some troubles.

sjoerdmeijer · 2025-07-11T12:10:55Z

@kasuga-fj : thanks for the report, shall I look at that miscompilation since you're probably busy with the delinearization?

@fhahn : yeah that makes sense. I am going to do some more testing. I might put up a little patch introducing an internal option to ignore the cost-model, I can see how this can be useful for testing and experimentation.

kasuga-fj · 2025-07-11T12:26:14Z

@kasuga-fj : thanks for the report, shall I look at that miscompilation since you're probably busy with the delinearization?

I've probably got a fix in mind, so I can take care of it next week if that's okay. But if it's urgent, feel free to take it over.

sjoerdmeijer · 2025-07-11T12:46:11Z

@kasuga-fj : thanks for the report, shall I look at that miscompilation since you're probably busy with the delinearization?

I've probably got a fix in mind, so I can take care of it next week if that's okay. But if it's urgent, feel free to take it over.

Sure, go ahead if you already have a way of fixing this.

davemgreen

64 sounds good if this does something useful nowadays. It could probably be the default for all CPUs.

sjoerdmeijer · 2025-07-11T16:52:26Z

FYI: I have kicked off different test jobs and am running different fuzzers over the weekend. I am testing this with interchange enabled, the cache line set to 64, and have disable the cost model, to test interchange as much as possible as suggested. I have already found one unrelated issue, #133922, but will report back next week on the interchange results.

sjoerdmeijer · 2025-07-11T16:53:43Z

64 sounds good if this does something useful nowadays. It could probably be the default for all CPUs.

I think so too. The default of 0 doesn't make an awful lot of sense.

sjoerdmeijer requested review from rj-jesus, davemgreen and kasuga-fj July 11, 2025 11:00

llvmbot added the backend:AArch64 label Jul 11, 2025

fhahn reviewed Jul 11, 2025

View reviewed changes

kasuga-fj reviewed Jul 11, 2025

View reviewed changes

davemgreen approved these changes Jul 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AArch64] Set the cache line size to 64 for the V2 and V3. #148213

[AArch64] Set the cache line size to 64 for the V2 and V3. #148213

sjoerdmeijer commented Jul 11, 2025

Uh oh!

llvmbot commented Jul 11, 2025

Uh oh!

kasuga-fj commented Jul 11, 2025

Uh oh!

fhahn left a comment

Uh oh!

kasuga-fj commented Jul 11, 2025

Uh oh!

kasuga-fj left a comment •

edited

Loading

Uh oh!

sjoerdmeijer commented Jul 11, 2025

Uh oh!

kasuga-fj commented Jul 11, 2025

Uh oh!

sjoerdmeijer commented Jul 11, 2025

Uh oh!

davemgreen left a comment

Uh oh!

sjoerdmeijer commented Jul 11, 2025

Uh oh!

sjoerdmeijer commented Jul 11, 2025

Uh oh!

Uh oh!

[AArch64] Set the cache line size to 64 for the V2 and V3. #148213

Are you sure you want to change the base?

[AArch64] Set the cache line size to 64 for the V2 and V3. #148213

Conversation

sjoerdmeijer commented Jul 11, 2025

Uh oh!

llvmbot commented Jul 11, 2025

Uh oh!

kasuga-fj commented Jul 11, 2025

Uh oh!

fhahn left a comment

Choose a reason for hiding this comment

Uh oh!

kasuga-fj commented Jul 11, 2025

Uh oh!

kasuga-fj left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sjoerdmeijer commented Jul 11, 2025

Uh oh!

kasuga-fj commented Jul 11, 2025

Uh oh!

sjoerdmeijer commented Jul 11, 2025

Uh oh!

davemgreen left a comment

Choose a reason for hiding this comment

Uh oh!

sjoerdmeijer commented Jul 11, 2025

Uh oh!

sjoerdmeijer commented Jul 11, 2025

Uh oh!

Uh oh!

kasuga-fj left a comment •

edited

Loading